Cross-Lingual Answer Validation
نویسندگان
چکیده
We describe three language-independent methods for the task of answer validation. All methods are based on a scoring mechanism that reflects the degree of similarity between the question-answer pairs and the supporting text. We evaluate the proposed methods when using various string similarity metrics, such as exact matching, Levenshtein, Jaro and Jaro-Winkler. In addition to this baseline approach, we take advantage of the multilingual QA4MRE dataset, and devise an ensemble method, which chooses the answer indicated as correct by the largest number of analyses of the individual translations. Finally, we present a language-augmented method that enriches the questions and answers with paraphrases obtained by means of machine translation. Our methods depend on parameters which we estimate using the dataset from CLEF2011. We show that all of the described approaches achieve a significant improvement over the random baseline, and that both majority voting and language augmentation lead to superior accuracy as compared with the original method.
منابع مشابه
The Bilingual System MUSCLEF at QA@CLEF 2006
This paper presents our bilingual question-answering system MUSCLEF. We underline the difficulties encountered when shifting from a mono to a cross-lingual system, then we focus on the evaluation of three modules of MUSCLEF: question analysis, answer extraction and fusion. We finally present how we re-use different modules of MUSCLEF to participate to AVE (Answer Validation Exercise).
متن کاملDocument Expansion for Cross-Lingual Passage Retrieval
This article describes the participation of the joint Elhuyar-IXA group in the ResPubliQA exercise at QA&CLEF 2010. In particular, we participated in the English–English monolingual task and in the Basque– English cross-lingual one. Our focus was threefold: (1) to check to what extent information retrieval (IR) can achieve good results in passage retrieval without question analysis and answer v...
متن کاملWikipedia and Web document based Query Translation and Expansion for Cross-language IR
a university for the world real R Introduction • Cross-lingual QA is getting more attentions because of the needs for cross-lingual information/answer seeking in the fast-changing, multilingual world of information. • For information/answer seekers, nothing is better than to be able to use their natural languages to compose their questions precisely without worrying the burden of translation.
متن کاملNiCT/ATR in NTCIR-7 CCLQA Track: Answering Complex Cross-lingual Questions
This paper describes our complex cross-lingual question answering (CCLQA) system for NTCIR-7 ACLIA track. To answer complex questions such as events, biographies, definitions, and relations, we designed two models, i.e., the centroid-vector model and the SVMbased model. In the official evaluation of the NTCIR7 CCLQA track, our SVM-based model achieved 22.11% F-score in the English-Chinese cross...
متن کاملTowards Knowledge-enriched Cross-Lingual Answer Validation
Our baseline approach from the 2012 year includes three language-independent methods for the task of answer validation. All methods are based on a scoring mechanism that reflects the degree of similarity between the question-answer pairs and the supporting text. We evaluate the proposed methods when using various string similarity metrics, such as exact matching, Levenshtein, Jaro and Jaro-Wink...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012